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ABSTRACT 

This document, produced through an ESRA Title III 
grant, contains seven position papers on education evaluation from 
the standpoints of psychology, sociology, political science, business 
management, school administration, and computer science. The first 
paper summarizes the history of educational program evaluation and 
reviews current theories and techniques. The second discusses the 
implications of viewing evaluation as equivalent to, or a special 
case of, behavioral-science research. The third is concerned with 
subcultures of the school, self-actualization, and student 
evaluation,. Evaluation of school-control is discussed in the fourth 
paper, with emphasis on dominant-submissive relationships which are 
customary in bureaucracies. The administrative function in evaluation 
is discussed in the fifth. The last two papers discuss program 
evaluation and the quantification of education for planning 
applications that use models for evaluation. An 87-entry bibliography 
is appended. (DE) 
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PREFACE 



The Interdisciplinary Committee on Education Evaluation was orga- 
nized to explore the parameters of the evaluative function included in the 
charter of the Gulf Schools Supplementary Education Center (P.L. 89 -IO, 

Title III). The first stage of the committee's work has been to prepare 
position papers on education evaluation from the standpoint of psychology, 
sociology, political science, business management, school administration, 
and computer science. 

Preliminary to the committee discussion, a study cf the develop- 
ment of theory and practice in education evaluation and a bibliography 
were prepared by Dr. Gene Atkinson, Associate Professor in the Bureau of 
Education Research and Services, University of Houston, assisted by David 
A. Knickel. The position papers included in this report were produced by 
individuals on the committee and agreed to in a series of committee meetings. 

The composition of the committee was designed to represent the view- 
points of different academic disciplines and institutions. It was thought 
that a task as complex and far-reaching as education evaluation should be 
of concern in many fields of endeavor. Accordingly, the committee was 
comprised of the following specialists: 

Dr. Edwin P. Willems, Department of Psychology, Rice University 

Dr. Charles Mulford, Department of Sociology, University of Houston 

Dr. Bancroft C. Henderson, Department of Political Science, Univer- 
sity of Houston 

Mr. Troy A. Womack, Vice-President for Business Affairs, Houston 
Baptist College 

Dr. Glen Self, Department of Industrial Engineering, Texas A & M 
University 

Dr. Wallace H. Strevell, Department of Administrative Education, 
University of Houston - Chairman 

The collection of papers should be read with appreciation of the 
contribution that each discipline has offered, since each author attempted 
to identify the research of his field that favored on education evaluation. 
The exercise, however, has had an essentially pragmatic purpose. We believe 
that guidelines have been stated for field testing a practical plan of 
evaluation. 



April 15, 1967 



W. H. Strevell, Chairman 
Interdisciplinary Committee on 
Education Evaluation 



EVALUATION OF EDUCATIONAL PROGRAMS : AN EXPLORATION 

Gene Atkinson, EdD* 
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* . .evaluation has become a sort of sacred cow in 
education, in the sense that everyone, both educators and 
laymen, theoretically believes in it. Not many people know 
what it is or what to do with it, but it is nevertheless 
widely regarded as "a good thing." The vague notion is that 
we can find out through some procedure (scientific or other- 
wise) what we are accomplishing with all these federal dol- 
lars sper.it on education. 

— Harold Howe II, U. S. 

Commissioner of Education, December, 1966^ 

The above remark, introducing the U. S. Commissioner's views on 
evaluation while summarizing recent activities of his office, might well 
suggest caution to all who wish to evaluate educational programs. Whether 
or not "vague notions" and "procedures (scientific or otherwise)" in 
evaluation have in the past retarded educational progress, future practice 
requires the investment of the best available talent and techniques to 
insure maximum achievement of desirable objectives. 

This report will briefly summarize the history of evaluation of 
educational programs and review current theories and techniques. Designed 
to give an introductory background to individuals interested in establish- 
ing or executing programs for educational evaluation, the report will 
include definitions and purposes of evaluation and will attempt to pin- 
point major authors for those wishing to pursue specific topics in detail. 



Definitions 



Dressel states: "Evaluation involves judging the worth of an 

experience, idea, or process. The judgment presupposes standards or 



*Associate Professor of Administrative Education, University of Houston, 
assisted by David A. Knickel, Graduate Research Assistant. 

^Harold Howe II, "Growth and Growing Pains," Saturday Review , 
ij-9: 68=70 1 Qlf December 17, 1966. 
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criteria .” 2 Shane's 1950 study showed five common interpretations by 
educators of the term "evaluation”: (l) as the application of value to a 

problem (e.g., "criteria for evaluating books”). ( 2 ) as a synonym for 
measurement (testing), ( 3 ) as a label for processes used in gauging teacher 
resources, and ( 5 ) as a -study of changes in children's behavior. 3 Only 
the latter two interpretations are considered here; both are incorporated 
in pier's "Basic Notions Regarding Evaluation": 



The process of evaluation is essentially the process 
of determining to what extent the educational objectives are 
actually being realized by the program of curriculum and 
instruction. However, since educational objectives are essen- 
tially changes in human beings, that is, the objectives aimed 
at are to produce certain desirable changes in the behavior 
patterns of the student, then evaluation is the process for 
determining the degree to which these changes in behavior are 
actually taking place 











Travers, however, points out that evaluation is more than merely 
determining the actual outcomes (i.e., changes produced in the students); 
evaluation also involves a judgment of the desirability of whatever out- 
comes are demonstrated to occur. 5 



History 



Evaluation of education has a history as long as that of education 
itself. Current systems of inquiry and analysis in the field share common 
roots, of course, with other fields of modern knowledge; the roots have 
been well traced in works on the history of modern science. 

Experimental science has made phenomenal advances from the time of 
Galileo and Newton, but techniques for measuring and appraising human 
behavior became common only in the latter part of the nineteenth century. 



O 

Paul L. Dressel and Associates, Evaluation in Higher Education 
(Cambridge: Riverside Press, 1961), p. 6. 

^Harold G. Shane and E. T. McSwain, Evaluation and the Elementary 
Curriculum (New York: Henry Holt and Company, 1951), pp. 5 535. 

^Ralph W. Tyler, Basic Principles of Curricu lum and Instruction 
(Chicago: University of Chicago Press, 1950), p. 69. 

^Robert M. W. Travers, Educational Measurement (New York: MacMillan 
Company, 1955), p. 7. 
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Three branches of the stream of science combined "to yield the vigorous 
measurement movement in psychology and its spread through education. . . 
These vere (l) the physiological and experimental psychology that had its 
main growth in Germany in the nineteenth century, (2) Darwinian biology, 
and (3) the clinical concern for the maladjusted and underdeveloped indivi- 
dual . Newly available quantitative data about humans and techniques for 
analyzing the data influenced research orientations not only in the sub- 
areas of psychology and education but in the other behavioral sciences as 
well. Although educators had appraised students and programs as best 
they could (generally by observation of student responses), new approaches 
were forthcoming; the measurement movement and new philosophical insights 
brought "wedges of inquiry and experimentation" into the thinking of the 
educational profession at the beginning of the twentieth century. 7 

One must note, in passing, the serious indictments by Callahan in 
Education and the Cult of Efficiency . Callahan argues persuasively, and 
with considerable evidence, that many of the school "evaluations" since 
1900 have been less products of "inquiry and experimentation" than capi- 
tulation by educational leaders to pressures by business-industrial groups 
demanding lower costs and greater efficiencies," considering the schools 
basically as factories. Among the individuals receiving criticism were 
(alphabetically) Leonard Ayres, Franklin Bobbitt, Ellwood P. Cubberly, 

Fred Englehardt, Paul Hanus, Henry H. Linn, Frank Spaulding, and George 
Strayer. However, Callahan’s accusation appears to have been directed less 
toward evaluative processes in general and more toward poor choices of 
objectives and criteria. 

Thomas has recently presented an input-output model of efficiency 
in education which attempts to meet Callahan's criticism by emphasizing 
that an efficient operation is "one which produces a maximum amount or 
level of the desired product, for given input of resources," with product 
defined broadly in terms of desirable educational goals. 9 

Since 1900, several major and hundreds of minor evaluative projects 
have been completed, many of these being oriented toward values other than 



^Robert L. Thorndike and Elizabeth Hagen, Measurement and Evaluation 
in Psychology and Education (New York: John Wiley & Sons, Inc., 19&1), p. 3* 

^William Clark Trow, S cientific Method in Education (Boston: Houghton 
Mifflin Company, 1925), Introduction. 

^Raymond E. Callahan, Education and the Cult of Efficiency (Chicago: 
University of Chicago Press, 19&2 ) . ~ 

9j. Alan Thomas, "Efficiency in Education" (Chicago: American Edu- 
cation Research Association Meeting, February 15, 1963)- (Mimeographed.) 




economic efficiency (e.g., "quality" of education, individualized instruct- 
ion, etc . ) . Accreditation of secondary schools by regional groups of 
colleges began in about 1900 (the College Entrance Examination Board was 
established in 1901).-^ The "Eight-Year Study" of the Progressive Education 
Association's Commission on the Relation of School and College utilized an 
elaborate and sophisticated evaluation program, with Ralph Tyler as Research 
Director .!! 

The Pennsylvania Study, conducted from about 1928-38 by Ben D. Wood 
and William S. Learned, set out "to review the current system of schooling 
in the light of objective evidence concerning its obtained results." Unfor- 
tunately, the especially developed "comprehensive" achievement tests appar- 
ently measured superficial knowledge of widely scattered facts rather than 
deep knowledge and thought processes. (Later Wood, working with the Ameri- 
can Council on Education established the Cooperative Test Service while 
Learned, through the Carnegie Foundation, established the Graduate Record 
Examination. 

The National Education Association's Committee on Tax Education and 
School finance cites several dozen studies which generally tend to support 
the Committee's conclusion that "better education does cost more." The 
report itself is a useful bibliographical guide. !3 

In 1.956, the New York State Education Department created the Quality 
Measurement Project which began with a four-year exploratory study of one 
hundred school systems and 70,000 students, and has published a School 
Quality Workbook describing the current evaluative process in that state. I 1 * - 

Pennsylvania's General Assembly, in the School District Reorgani- 
zation Act of 1963 ,» required the State Board of Education to develop and 
to implement a plan for evaluating the quality of educational programs in 
the Commonwealth. Details of a proposed plan were submitted in a three- 



!Q Evaluati ve Criteria , i960 Edition (Washington: National Study of 
Secondary School Evaluation, i960), p. 3. 

■^Eugene R. Smith, Ralph W. Tyler and the Evaluation Staff, 

Appraising and Recording Student Progress (New York: Harper & Brothers, 19^2). 

l^Travers, £P* > PP* 79-82. 

!3uational Education Association Committee on Tax Education and 
School Finance, Does Better Education Cost More ? (Washington: The Asso- 
ciation, 1959)* 

^William D. Firman, "The Quality Measurement Project in New York 
State," Science Education, 50 °. 239 - 19 , April, 1966. 
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volume report in 1965*^ 

Widespread interest in and concern with educational evaluation has 
been prompted recently by the Elementary and Secondary Education Act of 
1965; statutory provisions require evaluation of all projects funded under 
provisions of this law. 1-6 The Congressional expectation of prompt and 
periodic evaluation of results elicited Howe's remarks cited earlier. IT 

Two nationwide evaluation programs which appear to be imminent are 
bringing about both improved thinking and heightened emotional reaction. 

The project, "Assessing the Progress of Education" (also known as the 
National Assessment Program), with Ralph Tyler as Chairman of the Explora- 
tory Committee, aims to gather comprehensive, dependable, and valid data 
from which an index of educational progress, similar to the Gross National 
Product Index, can be derived. The educational progress index would afford 
a guide for the allocation of additional resources. Pilot studies were 
scheduled for 1966, with inauguration of the full project to come in the 
fall of 1967. A separate proposal is the "projected cost-effectiveness 
study of the U. S. Office of Education planned to involve some 300,00 
preschool children to college students in the fall of 1968. . . Ml 9 

Institutions of higher education have continually produced research- 
oriented evaluative studies, generally of themselves. The Cooperative 
Study of Evaluation in General Education of the American Council on Edu- 
cation sponsored, development of techniques and measuring devices for col- 
lege use. 20 



15Henry S. Dyer, "The Pennsylvania Plan, " Scienc e Education , 50: 
242-8, April, 1966. 

l6"How to Evaluate Federal Programs, " Nation's Schools , 77:49-71, 
May, 1966. 

^Harold Howe II, "Growth and Growing Pains,” Saturday Review , 
14.9:68-70, 87, December 17, 19^6. 

l^Ralph W. Tyler, "Assessing the Progress of Education, " Science 
Education , 50:239-42, April, 1966. 

1 9j. Raymond Gerberich, "Assessment: A Forward Look," Educational 

Leadership, 24: 115-8, November, 1966. 

20 Paul L. Dressel and Lewis B. Mayhew, General Education : Explora- 

tions in Evaluation (Washington: American Council on Education, ±9Wh 




Hagen and Thorndike noted in 1957: 
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. . .since World War II the focus of interest and 
effort appears to have shifted away from a research oriented 



attempt to develop new and tetter evaluation procedures, to 
an action research oriented attempt to involve school per- 
sonnel in evaluating their own programs. . .This type of self- 
evaluation is defended not in terms of the technical excel- 
lence of the appraisal that will result, hut on the ground 
that the evaluation experience will he a valuable learning 
experience for the teachers that engage in it. 21 



Recent literature, however, appears to indicate a strong resurgence of the 
research orientation. 



Theory and Techniques 



Evaluation involves "three distinct aspects: (a) selecting the 

attributes that are important for judging the worth of a specimen to he 
evaluated, (h) developing and applying procedures that will describe these 
attributes truly and accurately, and (c) synthesizing the evidence yielded 
by these procedures into a final judgment of worth. "22 Restated, these 
aspects become (a) identifying objectives, (b) developing and applying 
measuring instruments, and (c) interpreting results. 



I 
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Objectives to be measured may be drawn from three domains, desig- 
nated as structure, process, and product. 23 "Structure" includes such items 
as administrative data, school plant, and formal organization,, Studies of 
financial support of schools, for instance, are of this type and tend to 
assume high correlation with desired outcomes. "Process" refers to some 
aspect of school or classroom procedure. Ryan reports a small research 
study of this type at the college level; Mort’s Growing Edge is considered 
a process-oriented action research procedure for evaluating publis schools. 
Critics maintain that this type of study, too, generally can only assume 



M 



^-Elizabeth P. Hagen and Robert L. Thorndike, "Evaluation," Ency - 
clopedia of Educational Research (3rd ed.), pp. 482-6. 



22ibid. 



23lbid. 
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high correlation with desired outcomes. 24,25,26 "product’ 1 refers to the 
ultimate exhibited performance or behavior of students. The Eight- Year 
Study, the Quality Measurement Project in New York State, the current 
Pennsylvania Plan, and the proposed ’’National Assessment Program” represent 
product-oriented attempts at evaluation. 

Selection of objectives to be measured depends on the judgment of 
the evaluator. Sources of objectives for evaluation are similar to and as 
varied as the sources of objectives for curriculum, discussed in standard 
works in the field of curriculum development. 

Dressel points out that educational objectives may be examined 
relative to several ccntinua: simple — complex, achievable — unachievable, 

explicit— implicit 
goals for society. 

The Pennsylvania Plan has selected "ten major goals for education in 
Pennsylvania” as qualities to be measured; attempts are being made to trans- 
late these goals into measurable terms of student performance. The goals 
say that quality education 

. . .should help every child acquire (l) self- 
understanding and self-acceptance, (2) understanding and 
appreciation of social, cultural, and ethnic groups differ- 
ent from his own, (3) mastery of the use of words and num- 
bers, (4) positive attitudes toward school and learning, 

(5) the habits and values associated with responsible 
citizenship, (6) good health habits and knowledge of the 
conditions necessary for maintaining physical and emotional 
well-being, (7) experience with creative processes in a 
variety of fields, (8) a full grasp of the opportunties 
for preparing for a productive life, (9) understanding and 
appreciation of the natural sciences, the social sciences, 
the humanities, and the arts, and (10) preparation for 



£7 



intrinsic — transcendental, and goals for individual — 



^Antoinette T. Ryan, ’’Testing Instructional Approaches for Increased 
Learning,” Phi Delta Kappa > 66 : 524 - 6 , June, 19 ^ 5 * 

25p a ul R. Mort, W. S. Vincent, and C. A. Newell, The Growing Edge 
(New York: Metropolitan School Study Council, 1957 )* 

2 %agen and Thorndike, loc. cit. 



2 7paul L. Dressel, "Measurement and Evaluation,” Thirteenth Yearbook 
(i960) of American Association of College for Teacher Education (Wash- 
ington: The Association, I960), pp. 45-52* 
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effective participation in a changing world. 28 

These goals, of course, resemble others set forth by various groups in the 
past. 



Basically, instruments and procedures used to measure objectives 
are limited only by the ingenuity of the evaluator and the availability of 
time and money. The use of tests of all types is well known; aptitudes, 
attitudes, and achievements are regularly measured with more or less valid- 
ity. Even while old devices are revised toward improvement, new ones are 
being introduced. Webb, et al. , suggest an increased use of observation, 
review of archives (i. e., public records), and other techniques of which 
subjects would be unaware, to supplement the more traditional sources of 
information. *9 

Accreditation of schools, by use of self-evaluation procedures and 
visitation teams, is also a familiar process. School surveys are commonly 
used to evaluate special problem areas of school systems. 

Appropriate research designs are obviously essential in evaluation. 

A helpful summary. of several different types is given by Flanagan, who 
illustrates briefly with such examples as Project TALENT. 30 

The increasing availability of electronic data processing equipment 
and skills makes possible the collection and analysis of larger quantities 
of information than could have been effectively utilized in the past. 
Effectiveness of new programs, for example, can now more easily be evaluated 
by keeping track of individual students and their curricular experiences 
from a longitudinal point of view. 31 

If educators can remain "product-oriented" toward desirable goals, 
benefits in the form of improved evaluation may result from the introduction 
of new techniques developed primarily in the defense and aerospace indus- 
tries and in the federal government. Systems analysis is now being uti- 
lized as an approach to both large- and small-scale educational problems. 



2®Dyer, loc . cit. 

29Eugene J. Webb, et al., Unobtrusive Measure s: Nonre active Research 

in the Social Sciences (Chicago: Rand McNally, 1966). 

30john C. Flanagan, "Evaluating Educational Outcomes," Science 
Education , 50:2^8-51, April, 19 66. “ 

31 .Alvin Grossman and Robert L. Howe, Data Processing for Educators 
(Chicago: Educational Methods, Inc., 1965), p. 251. 
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Applications 

used. 32 All 

of the* newer 
in operation 



of PERT in the field of education are now being developed and 
of these techniques will probably be required if the demands 
concepts of program budgeting and cost-effectiveness (already 
at the federal level) are to be met by local school systems.- 3 



Conclusion 



An effort has been made to provide sufficient background information 
to enable interested readers to pursue futher topics of their choice in 
the broad and dynamically expanding field of educational evaluation. 
Although evaluation is admittedly difficult, guideposts offered in this 
•text and bibliography should be of some use to educators vitally interested 
in the evaluative process. 



32pesmond L. Cook, An Introduction to PERT (Columbus, Ohio: Eureau 
of Educational Research and Services, Ohio State University, 196^-) • 

33werner Z. Hisch, "Education in the Program Budget," Progr am 
Budgeting (David Novick, ed.), (Washington: U. S. Government Printing 
Office, 1965 ), Chap. 6, pp. 13I-5b. 
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IMPLICATIONS OF VIEWING EDUCATIONAL EVALUATION 
AS RESEARCH IN THE BEHAVIORAL SCIENCES 



Edwin P. Willems* 



** * 



Under a flood of public financial support and pressure toward inno- 
vation, experimentation, and change, resulting in a concern about short- 
term and long-range effects of unusual support and change, evaluation has 
become a distinctive password in education circles. As with many pass- 
words, it is easy to hope for too much from evaluation, and to infuse it 
with magic and power that it does not have. There should be some advantage 
in stripping the concept of its unintended and undeserved attributes and 
preserving its more realistic and workable ones. 



First, evaluation says nothing about what the goals of education 
ought to be. The generating and articulating of goals, hopes, and ideals 
come from community consensus and society's norms. Rather, evaluation is 
a process of gathering evidence that will form the basis for the most accu- 
rate statements possible about particular institutions, strategies, sequences, 
and styles of educational practice. While the resulting statements may 
focus on the extent to which goals and hopes are realized, evaluation itself 
is the process by which one uses the best skills and techniques available 
to make reasonable statements, that is, to answer questions about evaluation. 



Second, evaluation as such is neutral as to the level at which the 
questions are asked and answers sought. At one time, the question might 
be, "To what extent does school system A meet certain standards of economic 
efficiency?" At another time, one might ask, "Is teaching style Z enhancing 
the reading speed of intellectually marginal students?" Futherraore, evalu- 
ation as such is neutral as to the time perspective of the questions asked. 
That is, one might wonder about a portion of a particular class session or 
about lifelong effects. 



Viewing evaluation as the process by which persons use the best skills 
and means available to answer questions makes it equivalent to another set 
of human activities called research. Research has also become a password, 



and as a result, there are varying opinions about its power and usefulness. 
It is also common to infuse research with too much magic and power, and to 
circumscribe it with overly strict boundaries. For example, Kerlinger 



^Assistant Professor of Psychology, Rice University, 



^Gene Atkinson and David A. Knickel, "Evaluation of Educational 
Programs: An Exploration." (Paper prepared for Gulf Schools Supplementary 

Education Center, Pearland, Texas, 1966) . 
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says, n It should he clear then, that the ideal of science (research)*" is 
the controlled experiment. . .the controlled experiment is the desired model 
of science (research) . ”3 No particular method or technique represents the 
model of research. The model of research is the process by which a person 
or group of persons use the best skills or means available to gather evi- 
dence to answer their questions. Thus, evaluation and research are similar, 
if not equivalent, processes. 

Assuming, as Kerlinger does, that one particular method or technique 
represents the model of evaluation or research has two subtle but pervasive 
by-products: (a) Every issue or question about education or behavior must 

be tailored to fit that method, (b) When the issues or questions do not 
fit that method, they are not amenable to evaluation or research. This is 
putting the cart before the horse. Tools and methods must be chosen by 
persons to suit their problems and questions. 

Viewing evaluation as equivalent to, or as a special case of, research 
and viewing research as a personal, human process of seeking answers to ques- 
tions, have many implications, some of which will be discussed below. How- 
ever, one distinction must be made clearly at the outset, the distinction 
between "personal" and "private." Research is a personal activity, subject 
to many human foibles, mistakes, and weaknesses. However, research is also 
an explicit, documented process, open to public scrutiny, so that another 
person can independently repeat the process and independently verify or 
refute the resulting statements. A private, undocumented process for gener- 
ating statements is a mysterious one that cannot be independently repeated 
and checked. Such a private process is not research. The issue here is 
the general issue of the grounds for evidence. Whenever someone makes state- 
ments in the evaluation of education, it should be pofssible to assess inde- 
pendently whether his statements are justified. If the process of arriving 
at his statements is private, the accuracy of his statements cannot be 
assessed. 



Methodology in Behavioral Research 



The argument can be taken a step futher by viewing evaluation as a 
special case of research in the social or behavioral sciences. That is, 
evaluation is research on institutions, organizations, large and small 
social groups, and individual abilities and predispositions, i.e., the 



‘-Present author’s insertion. 

3f. N. Kerlinger, Foundation s of Behavioral Research : Educational 

and Psychological Inquiry (New York: Holt, Rinehart and Winston, 1964) , 
p. 291 . 
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intcrpley among physical, economic, and social environment and individual 
behavior and experience. The distinct advantage of viewing evaluation as 
social and behavioral research is that there exists an extensive literature 
on the issues, problems, strategies, techniques, and guidelines of such 
research. This literature is too far ranging to review here; however, 
auchors such as Gage,* Kerlinger (mentioned above), Selltiz et al .,5 Fest- 
inger and Katz, and Webb et al. ' have provided excellent treatments of 
research methodology. A great number of books and articles on educational 
Q nd psychological testing are also available. In other words, the process 
of obtaining data to answer questions, together with the precautions and 
guidelines required for making statements based on the data, have been 
thoroughly described and documented. However, in spite of this extensive 

there remains °ne special problem for education evaluation 
that has not been treated extensively enough and in enough detail. This 
problem has to do with what one is studying and the conditions under which 

one is studying it in the typical, workaday exercise of education evalua- 
tion. 



Two Strategies of Research 

... , 0ne way e yaluate an educational program, or a part of an educa- 

ional program, is to construct a paradigm case and study its outcomes, 
ha is, to conduct an explicitly arranged experiment. This strategy has 
he great advantages of any controlled experiment, such as carefully iso- 
lating and controlling the conditions and it increases the possibility of 

s * ate “f nts on what leads what. Laboratory schools and 
xpenmental educational groups, in which students are carefully selected 
an assigned to groups and in which teaching conditions and environmental 
circumstances can be varied or held constant at will, are examples of this 
evaluative strategy. A futher advantage of this strategy, here called 
experimental evaluation, is that the .rationale and guidelines for it have 
een discussed so carefully by writers such as Campbell and Stanley, 8 



^Nathaniel L. Gage, (ed.), Handbook of Research on Teaching 
(Chicago: Rand McNally, 1963). ~ 



. ^Claire Selltiz, et al. , Research Methods in Social Relations. 

(New York: Holt, Rinehart and Winston. I961J! ’ “ 

^ ^ ^keon Festinger and Daniel Katz, (eds.) Research Methods in the 
Behavioral Sciences (New York: Holt, Rinehart and Winston, 1953)” 



^Eugene J. Webb, et al. , Unobtrusiv e Measures: Nonreactive Researe 

in the Social Sciences (Chicago: Rand McNally, 1966). 

8 Donald T ; Campbell and Julian C. Stanley, "Experimental and Quasi- 
Experimental Designs for Research on Teaching," Handbook of Research on 
Teaching, Nathaniel L. Gage, editor (Chicago: Rand McNally, 1963), pp7”l71 
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Shaw, 9 Festinger, -0 and Kerlinger (mentioned earlier), who offer strong 
and reasonable arguments favoring the experimental approach. 

Granting the advantages and strengths of experimental evaluation, 
the major problem in adopting it as a strategy is that in the usual eval- 
uation of an educational program, whether by a participant in the program 
or by an outside person or agency, that which is being evaluated is an 
already existing, * intact program. By its very nature, the typical, worka- 
day process of evaluation involves what is here called naturalistic eval - 
uation or naturalistic research wherein the evaluator or researcher does 
not have manipulative, experimental control over the events he is studying. 
In contrast to experimental research, there is a relative lack of careful, 
thoughtful literature that discusses the strengths, rationale, and guide- 
lines for naturalistic research. In other words, despite a few exceptions, 



* 




9m. E. Shaw, "Social Psychology and Group Processes," Experimental 
Methods and Instrumentation in Psychology , J. B. Sidowski, editor (New York: 
McGraw-Hill ,""1966), pp. 607-553 • 

l^Leon Festinger, "Laboratory Experiments," Research Methods in the 
Behavioral Sciences , Leon Festinger and Daniel Katz, editors (New York: Holt, 
Rinehart and Winston, 1953 )> PP* 607-643. 

1^-The following paper discusses some of the differences between the 
experimental and naturalistic strategies: 

Edwin P. Willems, "Toward an Explicit Rationale for Naturalistic 
Research Methods." Human Development (in press.). 

l^Roger G. Barker, "Observation of Behavior: Ecological Approaches," 

Journal of Mt . Sinai Hospital , 31 *’268-284, 1964; 

Roger G. Barker, "Explorations in Ecological Psychology," American 
Psychologist , 20:1-14, 1965 ; 

Paul V. Gump and Jacob S. Kounin, "issues Raised by Ecological and 
Classical Research Efforts," Merrill - Palmer Quarterly , 6:145-152, 1959-lS^O; 

Edwin P. Willems, "An Ecological Orientation in X’sychology, " 

Merrill - Palmer Quarterly , 11: 317-343 > 19^5; 

Edwin P. Willems, "Toward an Explicit Rationale for Naturalistic 
Research Methods," Human Development (in press.); 

Edwin P. Willems, and H. L. Raush, (eds.) Naturalistic Viewpoints 
in Psychological Research (New York: Holt, Rinehart and Winston")! (in 
preparation. ) . 
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there is a comparative lack of material dealing with the kind of research 
methods that the typical evaluative process calls for. 

Dimensions of Naturalistic Research 

Because of the relative lack of detailed material on naturalistic 
research, some space will be devoted to the rationale and reasons for such 
methods . 

Inventory of educational phenomena . Handbooks and encyclopedias of 
the physical sciences agree in saying that potassium ranks seventh in order 
of abundance of elements, and constitutes 2.59 percent of the igneous rocks 
of the earth's crust; that potassium compounds are widely distributed in 
the primary rocks, the oceans, the soil, plants, and animals; and that sol- 
uble potassium salts are present in all fertile soils. Such findings on 
the distribution of potassium in nature are extremely helpful to persons 
who are evaluating occurrences of potassium and its compounds and effects. 

In other words, physical scientists and engineers have detailed information 
on the frequency and range of occurrence of the things with which they deal. 
It is odd that apart from the occasional, spotty educational census, the 
norms of a few tests, and some selective service summaries, the engineers, 
planners, and evaluators of educational programs and behavior know so little 
about how their phenomena are distributed in nature. They know relatively 
little about the frequency, range, and classification of teaching styles, 
problem-solving, succeeding, failing, aspiring, cooperating, accepting 
responsibility, playing, being valued, conflict, etc. In other words, in 
the evaluation of programs for optimizing and arranging human resources, 
there is no handbook, inventory; or periodic table which will put specific 
findings into perspective. The process of generating such a handbook of 
education-related behavior calls for a tremendous amount of naturalistic 
research or the observation and classification of intact, naturally-occur- 
ring educational situations. Evaluation, if viewed and conducted as serious 

research, could aid in filling this need. 

Problem of* yield . It is often thought that naturalistic research, 
the kind of research involved in evaluating existing programs, is ineffi- 
cient in that a disproportionately small amount of information is gleaned 
from the investment of time and money. Experimental evaluation is often 
viewed as being more efficient, in that the information yield is greater. 

The nroblem of ratio of effort to yield occurred in a study hy Gump 
and Kounin,^-3 who had been studying how disciplinary techniques directed at 
one child affected the other children in classrooms. Extending their studies 



^3paul V. Gump and Jacob S. Kounin, "issues Raised by Ecological 
and Classical Research Efforts," Merrill -Palmer Quarterly, 6:1^5-152, 
1959-1960. 



to nubile camps, they observed camp leaders and campers. In the written 
narratives describing 46 hours of the behavior of campers, only one incident 
of the type they were studying occurred for every 18 pages of narrative. 

This was extremely low yield from a naturalistic study. However, Kounin 
reported another study in which the yield was disappointingly low from a 
piece of experimental evaluation.! 1 * Kounin was again assessing effects of 
disciplinary technioues upon nontarget children, but this time he wanted to 
assess their affects upon children who ranged from high to low on commit- 
ment to the tasks. Afte r carefully training his experimental teachers, he 
arranged for a sample of adolescents to come to the laboratory school during 
the summer, for pay, to serve as experimental students, while in intact, 
naturally-occuring classrooms, he was able to find such students. The 
implication is that the problem of economy of effort is not unique to nat- 
uralistic evaluation. 

Artificiality. Perhaps more important than the question of yield as 
the liklihood of artificiality in experimental evaluation. Artificiality 
refers to educational arrangements and situations that students and teachers 
never confront in their intact, everyday experiences. Two published studies 
illustrate the problem of artificiality. 

Gumo and Kounin (mentioned above) wanted again to assess the effects 
of disciplinary techniques by teachers upon nontarget students, this time 
in college classrooms. They arranged for student accomplices to come late 
to classes, and they trained teachers to sometimes try supportive, friendly 
reactions and sometimes threatening, punitive reactions on the late arrivals 
In this experimental arrangement, they found that punitive, as compared to 
friendly, reactions resulted in lower ratings of the teachers' competence, 
likableness, and fairness by nontarget students. However, the investigators 
found through later questionnaires that the students were surprised that one 
of their college teachers would make a point of taking time out from class 
to correct and respond to a late arrival. In their attempts to create a 
neat experimental evaluation, Gump and Kounin had taken a phenomenon out of 
its everyday context and produced an arrangement that was not customary for 

the students. 

Gump and Sutton-Smith studied the reactions and behavior of poorly 
skilled boys when they were assigned to more or less difficult roles in 
games 15 In the experimental game, a boy who was It worked in the center 
of a rectangular playing field and attempted to tag other players who ran 
to and from safe areas at each end of the field. High power was given o 



14-jacob S. Kounin, "Dimensions of Adult-Child Relationships in the 
Classroom." Paper read at Topology Meeting, New York, August, 1961. 

15paul V. Gump and B. Button-Smith, "The It Role in Children's 
Games," The Group , 17:3-8, 1955. 
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the It role by having the boy who was It call the turns; he dictated when 
others could run. In the low power condition, the other players could run 
whenever they chose. Preselected slow runners were assigned to the two It 
conditions. In the low power as compared to the high power It conditions, 
poorly skilled boys experienced more tagging failures, uttered more comments 
indicating defeat and distress, and were teased and combined against more 
frequently. However, when the investigators adopted the naturalistic strat- 
egy and observed what happened in gyms, camps, and playgrounds, where they 
had not arranged and manijmlated the same conditions, they found that poorly 
skilled boys seldom got into difficult games. If they did get involved, 
they avoided or were exluded from the difficult roles. The experimental 
evaluation created conditions that were artificial and unusual for the boys. 
The naturalistic evaluations pointed out in whet ways the experiment failed 
to match the everyday event. 



There are several implications of these examples of artificiality. 
There is certainly nothing inherently wrong with experimental evaluations. 
In fact, experiments are often ideal for testing hunches and theories. On 
the other hand, it is clear that naturalistic evaluation is not a weak sec- 
ond cousin of experimental evaluation. In fact, it is the only choice of 
strategies when one wishes to evaluate intact, already existing educational 
arrangements. Finally, as a later section will suggest, naturalistic eval- 
uation can often point out specifically those ways in which experimental 
educational research has generated artificial, restricted conditions. 



Repeatability. Reference was made earlier to a distinction between 
personal and private activities, and the point was made that private activ- 
ities have no place in research or, therefore, in evaluation. One charac- 
teristic of evaluation that takes it out of the private domain and brings 
it into the acceptable research domain is repeatability. An evaluation is 
repeatable when the same evaluator or another evaluator can use the same 
techniques or observe the same conditions a. second or third time and end up 
with the same results. If two persons evaluate the same phenomenon, sup- 
posedly using the same techniques of observation and assessment, and get 
discrepant results, the evaluative processes they have used should be open 
to question as containing too many private, unrepeatable aspects.. Natural- 
istic research, the type typically required for evaluating existing educa- 
tional conditions, is often criticized on the grounds that it is not repeat- 
able. Fortunately, repeatability is not a matter for pure speculation and 
argument to decide; it can be decided on the basis of what persons actually 
find. One example will illustrate the point. 



In the face of a strong movement to consolidate high schools in 
Kansas, a group of psychologists sought to evaluate the effects of school 
size upon the experience and behavior of students. 16 0 ne subpart of this 



l^Roger £. Barker and Paul V. Gump, Big School , Small School : High 

School Size and Student Behavior (Stanford: Stanford “University Press, 196*0- 
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program of research, which is still in progress, sought to assess how school 
size related to the development of sense of obligation to school activities, 
or social responsibility, in the students. Two studies, four years apart, 
involving different samples of students, yielded almost identical results: 
students in small schools reported much more sense of social, obligation 
than students in large schools. Several studies independently obtained 
corroborating results. The point here is that all of these studies involved 
naturalistic methods of evaluation, and they yielded repeatable, nonprivate 
results. 



Generalizing . Much of the preceding discussion is relevant to what 
is perhaps the most important rationale for naturalistic evaluation, or 
naturalistic research. It is probably a truism that persons who are eval- 
uating an educational arrangement wish to make general concluding state- 
ments about the arrangement and perhaps others like it. Several examples 
will illustrate how naturalistic research sometimes disagrees with experi- 
mental research and will suggest how naturalistic evaluation serves a 
uniquely important function. The examples may appear to range far afield 
from direct educational applications, but their implications are rather 
direct and important. 

S. L. Washburn, the anthropologist, reports that baboons have been 
studied frequently in experimental research, as in primate laboratories, 
and in captivity, as in zoos. 17 When two or more baboons are confined 
together, it is common to find that one emerges as the leader, which is not 
surprising in itself. Under these conditions, the leaders emerge and main- 
tain their leadership through physical intimidation and comparative brute 
power. Washburn goes on to report the results of observations of baboons 
in their natural habitats, and one set of findings is of special interest. 
Leaders also emerge in the wild, but they appear not to emerge by physical 
intimidation and brute power. Superior cunning, sexual expertise, and 
attractiveness seem to be the route to leadership among baboons in the wild. 

The work of Beecher on pain relieving drugs also suggest the impor- 
tant contribution of naturalistic research. 1° Beecher's findings suggest 



!7sherwood L. Washburn, Phi Beta Kappa lecture, University of 
Kansas, 19 & 3 * 

^■^The following references summarize some of Beecher's research: 

Henry K. Beecher, "Relationship of Significance of Wound to Pain 
Experienced," Journal of the American Medical Association , l 6 l :l609-l6l3> 
1956 ; 



Henry K. Beecher, "Generalization From Pain of Various Types and 



that laboratory experiments on pain and pain-relieving drugs contribute 
little to the understanding of pain and analgesia as they occur outside the 
laboratory. Compounds that seem to relieve pain in the controlled labora- 
tory conditions often fail to do so in the wards of hospitals, while sub- 
stances which, by laboratory experiment would seem to be ineffective fre- 
quently are high effective in the clinical situation. In other words, 
the picture of drugs and pain relief would be incomplete without natural- 
istic evaluation. 

Hall and Williams, two psychologists, recently reported a fascinating 
study of how groups handle conflict and solve problems. 19 They compared the 
performances of 20 established,, intact groups of management trainees to 20 
contrived groups, groups with no history as groups before the individuals 
came to the laboratory. The established groups differed markedly from the 
contrived groups, expecially in the way they dealt with conflict. In other 
words. Hall and Williams found that it is precarious business to generalize 
from contrived laboratory groups to established, naturally occurring groups. 

The implications of these examples for educational evaluation are 
clear. It is seductively easy to be content with the results obtained in 
controlled experimental research and to assume that caution is the only 
requirement for generalizing these findings and making them applicable to 
the real life situation. The examples suggest that caution is not the only 
requirement, and that naturalistic research performs a uniquely required 
function. Not only is naturalistic research not a second-rate adjunct to 
experimental research, but it is very important in ascertaining the general - 
izability of experimental research. In other words, naturalistic evaluation 
promises to perform an important function in educational research. 

S ummary of naturalistic evaluation . A significant amount of space 
has been devoted to the rationale for naturalistic evaluation, and parts of 
the discussion appear propagandistic . This is so by intent, for two 
reasons. First, evaluation is a special case of research. The kind of 



Origins," Science , 130:267-268, 1959$ 

Henry K. Eeecher, Measurement of Subjective Responses (New York: 
Oxford, 1959 ) > 

Henry K. Eeecher, "increased Stress and Effectiveness of Placebos 
and Active Drugs, " Science , 132:91-92, i960. 

19jay Hall and Martha S. Williams, "A Comparison of Decision-making 
Performances in Established and ad hoc Qroups," Journal of Personality and 
Social Psychology, 3*214-222, 1966^ 



research represented hy the evaluation of intact, already existing situ- 
ations can and should he recognized hy all those who take educational 
research seriously. 



Advantages and Implications 



In behavioral research, the investigator roust have a clear idea of 
what he wants to study. In evaluation, one must have a correspondingly 
clear idea of what one is evaluating. In both research and evaluation, the 
investigator seeks to make statements that document the occurrence of some- 
thing or that document that one thing leads to another. In all these efforts, 
the investigator or evaluator attempts to gather information or data to form 
a basis for his statements. One advantage of viewing evaluation as a spe- 
cial case of research is that other persons engaged in research have learned 
certain critical lessons from the process of gathering data. These lessons 
have been well-documented, and should also be taken into account in evalu- 
ation. 



Reliability. One such important lesson concerns the reliability of 
the ways, means, and techniques used to gather information. Tests, first- 
hand observations, questionnaires, interviews, tabulations from school 
records, and reports of students and teachers are all examples of techniques 
that might be used for gathering evaluative information, and they constitute 
what researchers call measures, or data-gathering techniques. If one admin- 
istered an achievement test twice to the same group of students, under stan- 
dard conditions and found that students’ scores fluctuated widely from one 
time to the next, one should question the ability of the test to measure 
anything. This is one example of lack of reliability. Authors of published 
tests usually study reliability as a matter of course. However, it is easy 
to forget that any technique for gathering information is a measuring instru 
ment whose reliability should be assessed directly. If a technique or mea- 
suring instrument fails the test of reliability, then the answer to the 
critical question, "Does my technique measure anything?" must be case in 
doubt . 



The problem of realiability is not critical in measures of such 
things as weight, height, hair color, and sex, but measures of the kinds 
of things educational evaluation seeks to assess usually involve very compli- 
cated problems of reliability. If a zoologist has difficulty arriving at 
reliable measures of how many chimpanzees cross paths in a small jungle 
area, 20 it seems likely that evaluation of education will have difficulty 



20 aria an Kortlandt, "Chimpanzees in the Wild," Scientific American, 
206: 128-1 40, May 1962. 
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arriving at reliable measures of much phenomena as self-actualization or 
,f a full grasp of the opportunties for preparing for a productive life." 21 
Futhermore, it will not do for persons involved in evaluation to deemphasize 
and criticize those who worry about reliability, because reliability is 
fundamentally involved in what constitutes acceptable evaluative evidence. 

Validity . Reliability focuses on the question, "Do I have a measure 
at all? Assuming reliability, the next critical question is, "Does my 
technique measure what I think it measures?" The latter is a question of 
validity, a question that has been thoroughly probed in the literature on 
research. 22 By way of analogy, suppose one were informed that school A 
had beaten school B by a score of l4 to 10 in football. The scores would 
undoubtedly represent valid measures of the number of times the teams 
scored touchdowns, extra points, and field goals; however, they would prob- 
ably represent increasingly invalid measures of team ability, quality of 
coaching, and quality of football in the regions from which the schools 
came. Likewise, data on how often students are absent are probably a valid 
measure of attendance, but their validity as a measure of student initiative 
or sense of responsibility is much more questionable. 

Validity is directly involved in the grounds or basis for making 
evaluative statements, and the question, "Do his data measure what he thinks 
they measure? probably lies at the heart of many controversies that arise 
over evaluation. 

Con trol groups . Closely related to the problem of validity is the 
importance of control groups, another lesson learned the hard way by persons 
involved in research. Campbell and Stanley, in the paper cited earlier, 
discuss the usefulness and importance of control groups in detail, so that 
only enough space will be devoted to them here to suggest their relevance 
to evaluation. One detailed example will suffice. 

Sense of responsibility, commitment, loyalty, or sense of obligation, 
all refer to something that parents, teachers, and group leaders commonly 
hope children will acquire during their development. Several years ago 
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21 Henry S. Dyer, "The Pennsylvania Plan," Science Education, 
50:242-248, 1966. 



22 Validity is one of the most common topics in the literature on 
educational, psychological, and social research. The following is just 
one example of such discussions: 



Lee J. Conbach, Essentials of Psych ological Testing. 2nd ed. 
(New York: Harper and Row, i960) . 
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some studies were conducted to assess the relation between high school size 
and students’ sense of obligation to school activities. 3 For present purposes, 
the exact details are less important than the findings. Students in small 
high schools reported a strong sense of obligation to activities, muc 
stronger than their counterparts in a large school. However, the small 
schools were located in small rural- towns, while the large school was loca e 
in an urban area, a circumstance that made it impossible to attribute e 
differences in sense of obligation to school size instead of rural-urban 
differences. In this case, another study served the function of a control 
group. ^ The second study compared students in schools that varied in size 
but which were all located in relatively small towns, and obtained the same 
results for school size. Taking into account this control study, the impact 
of school size upon sense of obligation could be evaluated more clearly. 

In other words, rather than controlling anything in the usual sense, control 
groups, when properly chosen and used, allow one to attribute effects where 
they should be attributed, a very important consideration in educational 

evaluation. 

Only three implications, in the form of lessons to be learned, that 
follow from viewing evaluation as a special case of research have been dis- 
cussed in detail. The three areas discussed, reliability, validity, and 
control groups, deal with the process of gathering defensible information, 
and are perhaps the most fundamental. Other lessons learned from research 
also have their implications, and will only be mentioned. Here such spec- 
fie evaluative problems as construction and use of questionnaires and tests, 
proper sampling of circumstances and persons, interview techniques, and 
useful, informative statistical techniques for summarizing and presenting 



23 <rhe following papers summarize the research on sense of obligation: 

Edwin P. Willems, '’Forces Toward Participation in Behavior Settings. 
In Roger G. Barker and Paul V. Gump, Big School , Small School: High Schoa L 

Size and Student Behavior (Stanford: Stanford University Press, 19°^, 

pp. 115-135; 

Edwin P. Willems, "Participation in Behavior Settings in Relation 
to Three Variables: Size of Behavior Settings, Marginality of Persons, and 

Sensitivity to Audiences." Unpublished doctoral dissertation, University 

of Kansas, 19^5; 

Edwin P. Willems, "Sense of Obligation to School Activities in 
Relation to Size of School and Type of Student," Child Development (Paper 
submitted and under review). 

24 w . J. Campbell, "Some Effects of High School Consolidation." 

In Roger G. Barker and Paxil V. Gump, Big School , Smell School: Hi g h Schoo l 

Size and Student Behavior (Stanford: Stanford University Press, 19o*U> 

pp. 139-153. 
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the information gathered in evaluation all have their own unique problems 
and have all been treated extensively in the literature. 



Simplifying in Evaluation 



The reasons for, and advantages and implications of, viewing eval- 
uation as a special case of behavioral research discussed so far focus on 
factors that might usefully be taken into account in the actual exercise 
and activity of evaluating. One of the earmarks of research and, therefore, 
of evaluation, is that evaluative problems be refined, narrowed, and simpli- 
fied into manageable size so that reasonable, defensible information can e 
tethered. No one to date has satisfactorily evaluated education per se; 
every attempt at evaluation selects some aspect or aspects of the educational 
process as being of special interest or particular importance. Sometimes, 
insidiously, the selection is sometimes made according to the kinds of obser- 
vstionaland measuring tools that happen to he available Ho matter what 
the reasons for the selection, the typical approach to educational evaluation 
is a simplifying one, where simplifying means arbitrary untying, dismantling, 
and selection from the complex phenomena of education. The arguments in 
?avor of simplifying are compelling, but simplifying also has major drawbacks 
Just one such drawback, common to research in general and to evaluation in 
particular, is the very strong human tendency toward faddishness. In beha r 
ioral^research and in evaluation it is tempting to invest the piece of work 
one is doing with too much value and representativeness, and assume a^ 
what one is observing and studying represents, or is, education. In the 
overall effort called evaluation, not only is there room for, but there 
should be explicit attention given to widening, complicating, and enrich ig 
the scope and complexity of the set of things and occurrences to be evalu- 
ated. In other words, there is also a need for just the opposite of simpli- 
fying, for at least three reasons. 

The first such reason is the common tendency to develop stereotypes 
about what education is. One such stereotype might view education as the 
circumstances that produce creative problem-solving in students. Stereo- 
types have the advantage of leading to crucial, clear-cut evaluations, for 
example, measures of creative problem-solving on students from different 
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25gee, for example, Handbook of Research on Teaching , edited by 
Gage and mentioned above, and various chapters in. 

Gardner Lindze, (ed.) Handbook of Social Psychology (Reading, 
Massachusetts : Addis on-We si ey , 195*0- 
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schools and classrooms. However, stereotypes can also be translated into 
exclusive educational goals, whereby, for example, one might assume that if 
a little creative problem-solving is good and useful, more should be better. 

In addition to the tendency to stereotype education itself, a second 
reason for guarding against premature simplifying is a common tendency in 
research, called "the law of the hammer. The law of the hammer goes as 
follows: If you give a child a hammer, things to be pounded become the 
most important things around. Likewise, it is too often true that if one 
gives the researcher or educational evaluator a standardized achievement 
test, achievement as that test measures it becomes the most important aspect 
of education to be evaluated. Of course, the same can happen when the eval- 
uator has a pet way of measuring self-actualization, leadership, social 
responsibility, ego-strength, student satisfaction, problem-solving, and 
so on, but the tendency encourages premature simplifying. 

The third reason for avoiding premature simplifying focuses on the 
admirable but misleading little phrase: "Let’s look at the facts." The 

directive to "look at the facts" can often cut away much of the polemic 
and controversy in educational evaluation, but the response to it can also 
be subtly misdirected. As a case in point, the Department of Education of 
the State of Kansas published a comprehensive educational survey of Kansas 
schools in the late 1950 's, and the survey itself stated that any evaluation 
of the recommendations of the survey, without knowledge of the facts on 
which they were based, would be of little value. From the sizes, curricula, 
programs, and administrative organizations of Kansas schools, the survey 
concluded that the schools were suffering from educational malnutrition. 
Unfortunately, the survey did not provide the factual basis for a diagnosis 
of malnutrition, which should have included careful examination of the chil- 
dren who partook of the fare provided. To provide the factual basis, a group 
of psychologists at the University of Kansas undertook a series of studies 
of Kansas schools (see Barker and Gump, mentioned earlier), and found that 
when the experience and performance of the students were considered, some 
of the survey's conclusions were unfounded. These psychologists took ser- 
iously the dictum, "Let's look at the facts," indicated the ways in which 
the picture painted by the survey was prematurely simplified, and strength- 
ened the picture by complicating it. 

A second case in point is the history of attempts in psychology to 
evaluate the effectiveness of various types of psychotherapy. In the early 
1950's, the cry went up, "Let's just look at the facts," and various types 
of psychotherapy were evaluated in terms of simple discharge rates. ^7 











^From a personal conversation with Professor Egon Guba. 1 

J. Eysenck, "The Effects of Psychotherapy: An Evaluation," | 

Journal of Consulting Psychology, 16:319-32^, 19^6. I 
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These early attempts were completely inconclusive, and it is only now, four- 
teen years later, that the complexity of evaluating psychotherapy and guide- 
lines for evaluating it 8re taking useful shape. 28 

All evaluation requires some simplification and choice. However, 
keeping in mind (a) the common tendency to enshroud the concept of education 
in stereotypes, (b) the common tendency to define education by the available 
measuring tools, and (c) how misleading the call to "Look at the facts" 
can be, might prevent premature and overly narrow simplification. 



Evaluation and Everyday Practice 



Ml that has been said so far has been directed toward the implica- 
tions of viewing evaluation as research and the special contributions that 
evaluation can make to educational research in general. Little has been 
said about the relation between evaluation, or the persons engaged in eval- 
uation, and the actual, day-to-day practice of education in the hands of 
school boards, administrators, teachers, and students. In the eye of the 
day-to-day practitioner evaluation, especially when it is done by third per- 
sons and outsiders, is often seen as nosiness, intrusion, criticism, and 
recommended change. Perhaps, viewing evaluation as research will inten- 
sify such feelings. However, advocating a view of evaluation as research 
in no way implies a position that is incompatible with day-to-day practices, 
commitments, and traditional wisdom. 

If over the years many different approaches of the practice of 
education have been tried, if some approaches have worked better than others, 
and if those that have worked better have, to some extent, been more persis- 
tently practiced by their originators, or imitated by others, or taught to 
apprentices, then the practices that have emerged may represent a valuable 
and tested portion of all possible practices. However, the bases for select- 
ing and pruning among practices are very imprecise in the day-to-day pro- 
cess. Sometimes, what survives is a product of chance, or whim, or habit. 
Explicit evaluation, perhaps by an outsider, enters here as a means for 
sharpening the selection process. Explicit evaluation is not a source of 
ideas that are necessarily contradictory with day-to-day, accumulated wisdom, 
but is rather a refining process, or an aid to the refining process, supple- 
mentary to and sometimes superimposed upon accumulated wisdom. Thus, every- 
day accumulated practice and evaluation are both products of accumulation, 
remaining from experiences from which much has been weeded out. Evaluation 
as viewed here is one source of evidence for making decisions, and is not 
itself 8 policy-planning activity. 



J. Kiesler, 
Search for a Paradigm, " 



"Some Myths of Psychotherapy Research and the 
Psychological Bulletin, 65:110-136, 1966. 
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III 

SUBCULTURES OF THE SCHOOL; 

SELF -ACTUALIZATION AND STUDENT EVALUATION 



Charles L. Mulford* 



Two developments have been noted in recent social -psychological 
considerations of the school. First, it is widely recognized that it is 
useful to view the school in terms of its unique culture or ethos. As 
noted by Kallenbach and Hodges,, 1 "There is no doubt that each school has 
a culture of its own. These cultures can range from the comparatively 
homogeneous rural . . . , to the diverse patterns of an urban secondary 
school of three thousand or more students." The relationship between the 
school’s culture, student behavior, and effective evaluation techniques 
will be discussed in this paper. 



Secondly, recent research has suggested that the appropriate unit of 
analysis should be "the-student-in-the-school . " By this it is meant that 
the social needs and satisfactions of students should be considered vis-a- 
vis the requirements of the school as an efficient organization (having a 
formal organization structure, directive leadership, and aspects of mana- 
gerial control). These researchers specifically suggest that the social- 
psychological needs of students may not be congruent with some organizational 
needs and requirements. Student "self-actualization" 2 is a term used to 
indicate the degree to which students feel that their personal needs are 
being met and/or considered by the organization (school) officials. A 
discussion of the impact of the* culture of the school on student behavior 
will be followed by a discussion of the dynamics of student self-actual- 
ization. 



Why is a knowledge of the school's culture(s) or ethos necessary? 
BeckerS suggests that "culture (or reality) shock" may occur if the teacher’s 
expectations of his pupils' behavior do not "fit" closely to what actually 



* Assistant Professor of Sociology, University of Houston. 

lw. Warren Kallenback and Harold M. Hodges, Sr., Education and 
Soci ety (Columbus, Ohio: Charles E. Merrill Book, Inc., 19^3 ) • 

2 Charles L. Mulford, "Self-Actualization in a College Environment,' 
Journal of College Student Personnel (To be printed March 19^7) • 

3Howard S. Becker, "Social-Class Variations in the Teacher-Pupil 
Relationship," Education and Society. Edited by W. Warren Kallenbach and 
Harold M. Hodges, Jr. (Columbus, Ohio: Charles E. Merrill Books, Inc., V 

1963 ). 
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occurs. For example, "teachers find that some of their pupils act in such 
a way as to make themselves unacceptable in terms of the moral values cen- 
tered around health and cleanliness, sex and aggression, ambition and work, 
and relation of age groups." It is not necessary for the teacher to adopt 
the attitudes and patterns of behavior of his pupils, but foreknowledge of 
these (student cultures) will facilitate a person's role as educational 
leader. 



Specifically, how does a student's cultural "heritage" and value 
orientations relate to his school behavior? First, consider the influence 
of the student's social class background. Hollingshead^ reported in his 
study of "Elmstown Youth" that: the lower the social class of the students' 

family (l) the higher the drop-out rate, (2) the lower the vocational aspi- 
ration, ( 3 ) the lower the proportion planning on college, (4) the lower the 
course grades and the higher the failures, ( 5 ) the higher the proportion 
in commercial and vocational curricula, (6) the lower the education of the 
parents, (j) the lower the intelligence scores, and (8) the greater the 
number of recorded discipline problems. Wilson5 adds, "Social class differ- 
ences in educability stem from family differences in stimulation, training, 
and motivation." The home of the lower-class child, he suggests, is (I 
would temper this with "may be") no launching pad for upper-class orbiting. 
But, as Wilson further notes, upper and middle class families sometimes fail 
to reproduce themselves, both culturally and biologically. So, the schools 
must be able to identify talented students from all social classes, train 
them in the necessary skills, values, and attitudes, and send them into 
responsible positions in the community. 

In summary, a student's cultural heritage (often different from his 
teacher’s) preconditions him with regard to attitudes, values, perspectives, 
and knowledge. A teacher's understanding of the pervasiveness of social 
class influences should help to better understand the needs and potentials 
of his students. For example, a teacher should be able to answer these 
questions: 

1. To which social classes do students in his classroom belong? 

2. How does a student's social class influence his preparation 
for and selection of life goals? 

3 . When a teacher's social class heritage is different from that 
of his students, does this present problems? How are these 
problems typically resoJ_ved? 



^Discussed in Everett K. Wilson, Sociology (Homewood, Illinois: 
The Dorsey Press, 1966 ). 
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k. Can teachers identify students who are upwardly mobile, that is, 
those most likely to he selected for social status positions 
higher than those of their parents? What seems to be their 
special needs? 

5- To what extent do teachers realize that students’ behavior, 
achievement, and success may in part be a function of their 
social class heritage? 

Not only does the student’s cultural heritage affect his behavior, 
but in addition, he may also be affected by the dominant values and prac- 
tices among his classmates. Coleman^ suggests that the mediocre student 
rather than the talented is often rewarded. He found that the grades of 
"A" go to the truly exceptional student only where the student population 
itself honors academic achievement. Where the intellectual pursuits are 
not esteemed in the student subculture, students of lesser ability receive 
high grade records. The more talented students may tend to seek social 
status and recognition from their peers, whose value orientation they share, 
in athletics and social popularity. In this same study, Coleman notes that 
these social rewards have very little to do with facilities and equipment. 

But observation tells any experienced teacher that not one but sev- 
eral subcultures or systems of student attitudes and values may exist in a 
school. Newly arrived students may encounter a cultural environment often 
diverse in its manifestations. Clark and Trow suggest that four subcultures 
typically are present: the academic, collegiate, nonconformist, and voca- 

tional subcultures. Clark and Trow's typology of possible student subcul- 
tures is shown below. 7 

Identify with School Involved with Ideas 

Yes No 

Yes Academic Collegiate 

No Nonconformist Vocational 

The collegiate subculture is the world of dates, sports, and fun opportu- 
nities that are possible within and concomitant with school. The student 



6james S. Coleman, "The Adolescent Subculture and Academic Achieve- 
ment," American J ournal of Sociology , (Jan. i960), p. 337 et passim . 

7The author is leaning quite heavily in this section on an adaptation 
of Burton R. Clark's and Martin A. Trow’s "Determinants of College Student 
Subculture" presented in Sociology by Leonard Broom and Phillip Selznick, 
(New York: Harper and Row, 1958), 3rd ed. pp. ^53-^56. 
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who identifies with the school because of these socially rewarding oppor- 
tunities hut does not identify with the world of ideas is said to he a mem- 
ber Oi the collegiate subculture. A second orientation is the vocational 
subculture. In this subworld, students are narrowly concerned with job 
opportunities and preparations. A school pursuit is valued only if it will 
enable the student to "get a job." "Like the collegiate, the vocational 
subculture is resistant to intellectual demands beyond what is required to 
pass the course. "8 



£ third group of students identify with the academic subculture. 
That is, they identify with both the school and world of ideas. These 
serious students identify with the intellectual concerns of the teachers 
but may be regarded as "grinds" by some students. 



A fourth orientation is the nonconformist group, "in the eyes of 
their more conforming classmates they are the unwashed. . . These somewhat 
rebellious students are concerned with the intellectual pursuits but not 
with the school as an institution. It \the subculture^ offers the rebel - 
lious student shelter and intellectual support for his rebellious ideal- 
ism. "9 



It is generally assumed that the relative importance of the colle- 
giate subculture has declined in recent years, during which in the same 
period of time there has been a relative "triumph of vocationalism. " How- 
ever, a recent surveylO suggested that only 19 percent of entering college 
freshmen from 23 different schools regarded themselves as "academic", 4 
percent described themselves as "nonconformist", 27 percent as "vocational' 
and 51 percent described themselves as "collegiate" (fun and games). 



To what extent should a teacher be able to answer these questions 



1. What subcultures exist in his school? How many 
students belong to each subculture? 



2. Which subculture seems to be dominant? 



3. 



How do teachers deal with members of the different 
subcultures; are teaching and counseling primarily 
oriented towards the needs of only one subculture? Which? 



8 Ibid. 



9lbid. 



10 Student Life Division, Faculty and Staff Bulletin, No. 3 (Houston, 
Texas: University of Houston), Nov. 9, 19^. 
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It was noted earlier in this paper that a second major concern has 
focused on whether or not there may he a lack of congruency between the 
social -psychological needs of students (student seilf-actualization) and 
the organizational requirements of the educational system. It is recog- 
nized, of course, that any individual involved in any relationship (includ- 
ing students enrolled in schools) necessarily surrenders some "freedoms” 
and degrees of personal autonomy. A healthy balance between social needs 
and organization requirements is necessary. Researcn indicates that stu- 
dent self-actualization may be significantly related to school academic 
success and achievement. In this study of college students, a test was 
used that has been developed to measure student self-actualization. Scores 
from this test, which were significantly associated with academic success, 
were relatively independent from other traditional variables used in pre- 
diction studies such as sex, social class, high school preparation, and 
so on. 



Are these questions relevant for teachers: 

What are the important predispositions (needs and desires 
"expected" to be met) of students in this school? 

What attitudes do teachers have toward these perceived 
needs and desires? 

How can teachers structure their classroom so that maximum 
self-actualization will result? 

[ It may be that the two concepts, subculture identification and self- 

■ actualization, can be profitably considered as part and parcel of the same 

[ problem. Put the question this way: which students are most likely to 

have their needs met in the classroom? That is, toward which subculture 
| do most teachers direct their best talent and resources? 

5 An almost impossible dilemma seems apparent. Unless all students 

| come to identify with the teacher's orientation, a most unlikely event in 

I most large urban schools, the only alternative may be that teachers will 

' have to realize that different groups of students have different needs. 

| Quoting from the recent survey of college students mentioned above, "The 

; figures cited not only bear out the thesis that a minority of students can 

I be described as primarily academic in orientation, they indicate also 

the strong concern of other students with the emotional aspects of their 
lives. Love and affection, self-respect, and personal identity seem to 
be in the center of their concern. "12 



llMulford, loc. cit. 

l^Student Life Division, loc . cit. 
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In summary , each student (and each teacher) brings with him a cul- 
tural heritage stemming largely from his social class membership. A teach- 
er's understanding of the many implications of social class may" help him 
better meet the needs of his students. In addition to this influence, 
students may relate to another (independent of social class influences) 
cultural reality — one of several possible student subculture orientations, 
each with a different theme of life or ethos. 

To maximize mental health, self-actualization, and objective achieve- 
ment for each student is certainly a worthy goal. If this goal is to be 
met, however, a teacher’s teaching, counselling, advice, and evaluation of 
his students may have to be extremely varied and rich in its content and 
form. These issues call for additional study and research so that the 
teacher may be aided in this endeavor. 
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EVALUATION OF SCHOOL CONTROL RELATIONSHIPS 



Bancroft C. Henderson* 



The act of evaluation is the verbal or symbolic human process of 
applying values to some aspect of a society. Values are the degree of 
preference that the evaluator applies to a facet of a society. All phenom- 
ena for any one time or place. There are conflicting values* unconscious 
values, flexible changing values, ambiguous values, and values associated 
with roles. Values are relative and not absolute; they change with time, 
place, and role. They differ between societies. Within a society they 
differ in relationship to subsystems like the schools or between roles 
such as superintendent and teacher. 

The collective act of administering evaluation, rather than indivi- 
dual evaluation, could serve the purposes of a school system by focusing 
attention on alternatives, by enriching the dialogue concerning school 
policies and their consequence, by increasing the awareness of values 
implicit in all policy, and by making more conscious the needs for research 
in relationship to policies and flexibility in regard to those policies. 

Evaluation of control relationships places an emphasis upon the 
dominant -submissive relationships which are customary in bureaucracies with 
hierarchical structures. Control relations are then those aspects of a 
role which define formally, or informally, the discretion of the role part- 
ners, the degree of obedience or compliance which is expected, and the 
amount of initiation for change. Control is related to written regulation, 
hiring and firing authority, promotion and demotion authority, and other 
forms of sanctions. The relationship is important; it deals with a series 
of important questions. Who makes final policy? How independent should 
professional people be of direct supervision by nonprofessionals? Are 
teachers professional persons? How much responsibility should be left 
with the student and teacher in order to encourage mature self-regulation? 
How much should a school system cut itself off from the student subculture? 
Can democratic values be learned in an authoritarian setting? Should the 
public run our schools? How should the public participate in school policy 
formation? These and other critical questions are related to school control 
and its evaluation. 



♦Associate Professor of Political Science, University of Houston. 
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If the system is defined as the school board and all of the person- 
nel, students, and property of the schools per se, then the outside controls 
come from units of government and from public elections. 

Since outside control factors are also outside the policy discretion 
of the school system, the type of questions considered in evaluation should 
not be entirely focused upon the wisdom of the policy itself, but should 
emphasize the type and style of compliance. For the purpose of evaluation 
of schools this is significant because the list of limits or controls is 
long and dramatic. It includes from many states: 

1. Keeping records of student attendance 

2. Making general reports on students 

3. Checking sight and hearing of students 

b, Making inventories of school property 

5. Using state adopted textbooks 

6. Reading Bible daily without comment 

7. Following established course of study 

8. Supervising fire drills 

9. Teaching health education with reference to 
•use of alcohol and narcotics 

10. Teaching certain ideas basic to good citizenship 

11. Requiring loyalty oath by teachers 

12. Refusing teachers right to wear religious garb 

13. Requiring teachers to attend conferences or institutes 

lb. Requiring teachers to contribute to retirement fund 

15. Reauiring teachers to undergo annual physical health examination 

16. Demanding teachers have no obligation to book companies 

When you add this list to the limits set by taking restraints and 
financial aid as well as certification requirements for individuals and 
accreditation requirements for systems, the authority outside the system 
seems great. However, there are grants of authority to the levels within 
the system that are also extensive. These will be considered at appro- 
priate points later in this paper. 

Evaluation of controls at this point can involve some significant 
questions. How well aware are participants of these controls? Is there 
a handbook that describes the rules and regulations? How much support 
do these limits get from boards, administrators, and students? Do associa- 
tions of boards, supervisors, and teachers express any opinion or have 
any voice in the writing of such policies? Are individual persons encour^ 
aged to participate in the formation of these policies? Are the policies 
obeyed? Are students aware of any noncompliance? These questions, ae 



lleo Martin Chamberlin, The Teacher and School Organization (New 
York: Prentice -Hall, Inc., 195^), pp» ^4-66. 




those included later, are of a "fact" nature and 8re intended j s a basis 
for stimulating evaluation. When these kinds of questions ar j. developed 
into an evaluation format they might cause discussions of significant 
issues. / 

The appropriateness of full participation of professional teachers 
in policies that regulate the profession is one of the most significant 
issues. The doctor dominates policy regarding doctors /as do engineers 
and lawyers for their professions. This does not seem to be the case for 
teachers. 

In another respect there are some significant control issues that 
stand as examples of the heart of the American political and social system. 
The way a system reacts to these issues will demonstrate to the student 
more significantly than any lecture the commitment to values related to 
these issues. Examples of such issues are school racial segregation, public 
aid to church schools, released time for religious instruction, religious 
prayers, flag salute requirements, teacher loyalty oaths, and legislative 
investigation of the beliefs of teachers. 

If a school system vigorously participates in, policies to reduce 
outside control, there could be in a general way an adverse reaction in the 
minds of students as the local school system would be vigorously protecting 
its "right of self-determination." The effect could be the impression of 
narrow provincialism plus hostility to government and politics. This may 
be an unfortunate message to inadvertantly communicate to students and 
the public. This possibility should be evaluated. 

Since politics is conflict and conflict resolution, participation in 
politics is also significant to the resolution of conflicts over educational 
policy. Such conflicts as vocation vs. academic emphasis, much sex educa- 
tion vs. little concern, foreign language offering vs. little attention, 
and even the issue of whether controversy shall be encouraged, tolerated, 
or penalized are vital. Thus while vigorous political participation which 
is aimed at both limiting and affecting outside influence may communicate 
one set of values, nonparticipation would demonstrate political apathy and 
would leave the resolution of educational conflict issues to others who 
need the conceptualization of alternatives that a professional could offer. 
If the ideal is "expert" participation with a basis in professonal recog- 
nition, then the evaluation instrument should focus upon the degree to 
which this is a practice. 



School Board Control 
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There is a broad range of authority that a school hoard may exercise. 
Following are some categories 

1. May regulate dress 

2. May require physicals 

3. May exclude those jeopardizing health of others 

4. May require vaccination 

5. Can expel persons who join fraternities or secret societies^ 

6. Can exclude married students from activities such as football^ 

7- May prescribe required curriculum 

8. May prescribe teacher method 

9 . Can expel a student who breaks any rule 

10. Can expel for misconduct that does not break a rule5 

11. May expel a student without notice of hearing 
(court cases may have modified this point) 
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This list is included to indicate that authority is extensive. The 
limits upon the use of this authority are thus within the system. Since 
the authority over teachers and administrators is also extensive and most 
teachers do not have the right to effective resistance, this issue of 
board control is critical. ° Evaluation may cover some of the following 
questions which could raise the issue of appropriateness in relationship 
to significant values, for what is legal may not be valuable. 

Are hearings and appeals provided in all disciplinary cases involving 
teachers, students, and administrators? Is hiring, promoting, and salary 
on the basis of merit or influence? Are citizenship rights of students 
protected or is in loco p arentis the perspective? How well are principles 
of free speech protected for students and faculty? Are there multiple 
channels of communication to the board, or is the superintendent the only 
channel? Do boards or their members belong to board associations? Are 
board members well-informed? Do they attend meetings regularly? Is repre- 
sentation on the board fairly apportioned in the population? Are school 
board election contests based on significant issues? Are the issues fully 



^Lee 0. Garber, The Law Governing School Board Members and School 
Board Meetings (Danville, Illinois: Interstate Printers and Publishers, 

I9537,' IpTT^B. 

3i8 Ca. App. 19* 
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587 (Ark. ) 254. 

^Garber, op. cit . , No. 3»> P* 6. 
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discussed and understood by the public? Is the board primarily a policy- 
maker and not administrator?? 

Many of these questions deal not just with the nature of control but 
with the style of control and the degree that the rest of the school system 
is likely to respect and accept the controls of the board as being repre- 
sentative of public thinking and based upon adequate information. When 
policies are soundly based, teachers and administrators find it difficult 
to resist them. Some of these types of questions were indicated by research 
that has been conducted concerning superintendents’ expectations of school 
boards. 

If these types of issues are incorporated into an evaluation instru- 
ment, the effectiveness of the board in relation to the system would be 
indicated. 



The Administrative Role and School Control 



Since this paper is short and intended only to be indicative of 
evaluation concerns, this section will combine all administrative levels in 
order to remain at a general level. 

The subject of administration and control is related to the existence 
of a profession of education administration. One characteristic of a pro- 
fession is the degree of autonomy of the administrator and the respect that 
is shown for his expertise. 

Donald E. Tope writes about the development of a profession of edu- 
cation administration. 

Such an establishment must assess the current status 
of the school administration, the standards of selection and 
credentialing, the quality of the preparation programs de- 
signed to equip him for the job, and the social conditions 
necessary for him to perform his task effectively.® 

It would appear from this statement and related remarks that the primary 
social condition necessary would.be a high degree of self regulation, 
selection, and training of the professional by the professional. The 
areas of professional administrative authority deserving evaluation could 



?Keith Goldhammer, The School Board I96U (New York Center for 
Applied Research). 

®Donald E. Tope, The Social Science View of School Administration 
(Englewood, New Jersey: Prentice -Hall, 1965); P* 2* 
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■be: 

1) Degree of autonomy of control and the degree of initiative 
allowed, for the establishment of preservice and inservice 
education for prospective administrators^ 

2) Degree of access to board policymakers 

3) Freedom and encouragement for affiliation with professional 
associations 

4) Establishment of administrative organizational pattern 

5) Setting of job description qualifications. 

The degree to which the appropriate administrative levels dominate these 
policy areas and processes would be a measure of professionalism. An evalu- 
ation instrument could appropriately include this topic. 

Another topic related to administrative evaluation could be the 
degree to which the administrative levels have secured board support. There 
is some evidence that this could be a problem related to policy. As an 
example, school board members and superintendents were asked, "is it desir- 
able or undesirable to give numerical grades on regular report cards in 
the first six grades?” Fifty-three percent of school board members said 
that it was desirable while only 9 percent of superintendents agreed. 

This would indicate that there may not have been an informal discussion 
concerning broad and general problems of interest to the board and the 
superintendent. Such a lack of dialogue could be related to conflict and 
estrangement that might be avoided by a presentation, in an informal set- 
ting, of the arguments and data behind some types of policy. In some cases 
of concern both superintendents and board members might be benefited by 
this dialogue . As an example, 73 percent of the board members and 61 per- 
cent of superintendents believe in the "teaching” of the desirability of the 
American way of life. 11 Such high approval upon such a question may indi- 
cate a lack of concern for the implications of such an approach. More 
frequent discussion of issues initiated by administrators would change 
policy positively. Also the discussion would possibly increase the board 
respect for the professional character of the administrator and his know- 
ledge of the complexity of school operations and policies. 

9lbid. 

lONeal C. Gross, Who Runs Our Schools (New York: Wiley, 1958), p. Il6. 

11 Ibid. 
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Much of the impact of school administration and control is upon the 
classroom teacher. If the teacher perceives the administrator both as 
hostile and as the source of inadequate planning as one older study indi- 
cates, then the reaction of the teacher may be hostile. 12 Turnover rates 
or other objective measures of hostility plus attitude measures would be 
indicators of the adequacy of the administrator-teacher relationship. 

Since the nature and style of control may be the source of the hostility, 
means must be found, to evaluate these relationships. 

Related to this issue of morale and control is a parallel question 
of professionalization of the teacher. The administrator is a participant 
in a hierarchy that will necessarily limit professional autonomy but the 
teacher is not in such a role and may expect the degree of autonomy that 
the role model for the professional receives. The role model may be the 
college teacher. If this is correct then there is a crisis in legitimacy, 
for there is little indication that school administrators see the role of 
the teacher as being much above that of an ordinary employee. One writer 
on this topic said: 

The democratic concept of supervision emphasizes respect 
for personality. The teacher as an individual is to have free- 
dom to think her own thoughts, to exercise initiative, to devel- 
op self-reliance and to assume responsibility for and intelli- 
gent participation in the direction of instruction and the 
determination of policies. 13 

While this statement may not be in itself objectionable it is in 
terms of obligations and not rights or prerogatives indicating authority. 
At another point the same author indicates a closer supervisory relation- 
ship: 



Although there has been much development of inservice 
programs for teacher education in recent years, there is still 
a great need for adequate supervision of the novitiate's efforts, 
and for close and careful suggestion, assistance, and direction 
so that he may develop desirable teaching methods and procedures, 
as well as broaden and increase his academic and professional 
knowledge.!** - 



!^A. S. Barr and Nels 0. Reppes, M The Attitude of Teachers Toward 
Supervisors," Journal of E xperimental Education , U:237-301> June, 193&. 

13charles E. Boardman et al . , Democratic Supervision in Secondary 
Schools (Houghton Mifflin, 1953 )> P* 28. 

l^Ibid. , p. 8. 
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At another point the same author indicates that any delegation of 
authority is provisional. Other writers are even more blunt. One said, 
"It will be delegated by the group to a person or persons and withdrawn if 
not exercised toward the achievement of commonly determined objectives . nl 5 



If the content of a teacher's classroom performance were outside of 
the proper jurisdiction of administrative authority then the extensiveness 
of that authority would be blunted at an appropriate level. There is no 
indication of such a limitation. One writer of a general liberal persuasion 
argues that no teacher should be allowed to make derogatory remarks about 
the United States government .16 



If there exists a conflict between professional and hierarchial 
perspectives on teacher roles or if there is a functional advantage in 
moving toward a professional perspective, then an evaluation instrument 
should assess the issues around this topic. Here are some examples of 
questions that are appropriate at this point: Is the out-of-class time of 

a teacher scheduled by school administrators? Are teachers required to 
participate in extracurricular programs? Is there a check made of lesson 
plans? Do administrators attend a teacher's class without permission? Are 
the class activities of a teacher defended by an administrator as being 
part of his authority? Are teaching methods determined by the individual 
teacher or teacher groups? Can a teacher invite outside speakers without 
permission? What are the tenure protections of teachers? What authority 
does a teacher have over the subject, age level, and time that he or she 
shall teach? Is attendance at meetings and conferences voluntary or com- 
pulsory? Are promotions based upon objective standards? 



These kinds of questions are related to the degree of professionalism. 
There are other subjects that need evaluation in relationship to the teacher 
and controls. One of these is the degree of acceptance by the teacher of 
existing limits. Another is the participation in the making of policy. 

Yet another would be the degree of flexibility of policy itself. One 
last subject would be the informal compliance or noncompliance with policy. 



The Student and Controls 



The theoretically almost absolute and final power of the parent when 
transferred to the school system has left the impression that the issue of 



■^William H. Burton, "A New Definition of the Function of Supervi- 
sion," Educational Methods, 18:6, October, 1938. 



^E. Edmund Ruetter, The Admin ist rator and Subversive Activities, 
(New York: Bureau of Publications’^ 1951 J> P* 105 . 
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authority over the student is settled. Students lose most of their rights 
as citizens when they walk onto school grounds or participate in school 
activity. The issue is not settled and there are arguments for opening 
the issue. One argument is that such extensive control relations may not 
serve educational goals because they would encourage dependency. Also 
there would be a great discontinuity between high school and college where 
citizenship rights are beginning to be protected. Another reason is that 
the courts are gradually curbing some of this school authority. Also it is 
difficult to demonstrate democratic values in an autocratic situation. 
Another argument involves changing family patterns. If there is greater 
independence and self-reliance in the home there may be tension caused by 
a conflict between the home and school role expected of the student. Anoth- 
er factor involves intergenerational conflict. A teacher exercising uni- 
lateral authority may alienate and misunderstand another generation if the 
relationship to that generation is autocratic and thus somewhat disassoci- 
ated. Such alienation would inevitably reduce communication effectiveness. 
For these reasons and others, the argument that is offered here is that 
control relationships with students, involving degrees of discretion, should 
be evaluated. Changes and modification resulting from evaluation might be 
advantageous . 

An evaluation could involve a range of issues. Following are some 
examples of issues: Are there clothing or hair regulations for students? 

Are these regulations written and distributed or are they assumed to be in 
the general authority of the faculty? Do students participate in writing 
the regulations? Are students consulted about regulatory policy? Are 
there appeals from disciplinary action? Are students involved in discipli- 
nary enforcement? At which grade levels? Is the student government given 
any real authority? Is student government sort of a company union, or 
agency of administration, or is it a genuine lobby or policymaker? Are 
student organizations run by students? Do faculty sponsors have a veto 
over funds and activities? Is greater freedom or authority given older stu- 
dents or more advanced students in comparison to younger or less advanced 
students? Are students permitted or encouraged to disagree with teachers 
or administrators? Are reasons given when policy is announced? 

These questions only indicate a few aspects of student control or 
regulation. Each of these deserve attention even if the decision is made 
to retain the status quo. This conclusion stands for the entire paper. 

The whole subject of school control should be a central one for school 
evaluation because it is central to the entire educational mission. 
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